There are 28,027 genes
also in SQLShare
Can get lengths
SELECT CGI_ID,len(sequence) as CDS_length FROM [sr320@washington.edu].[qDOD_Cgigas_gene_fasta]
--------------------
Now want to get genomic structure of gene..
GFF has Start on Stop and presumably includes introns….
Reconfigured to get ID out
Now lets get the corresponding fasta (again)
missing ID is in GFF.
---
no idea what is going on
CGI_10006842
is in fasta
and gff
Missing in SQLShare
Downloading to desktop and look at in TextWrangler.
In Short
CGI_10006842
---